Pink Rabbit part 1
===


### General Questions ###


##### GQ1 #####


investigator: Okay, recording in progress. So, now that we are at section "general questions regarding usage of digital libraries", you should see the first question, which is which task do user usually use the digital library for. Please tick all answers, which apply and complete your own tasks. Please give oral examples of the tasks you're ticking.

So, for example, if you would use digital libraries for "person search". You could say, okay, I tick "person search" because I want to keep track of myself. So, yeah.

Pink Rabbit: Okay. So, first I will tick "person search" for two reasons, mainly for testing if I develop any feature in DBLP. Second, when I write an academic paper and I see that one person did a lot of writing in the specific subject, which I'm covering, I will look at the person and look at their output to understand if they have maybe even more interesting papers for me. Because of the same reason, I will also tick the second one, the "paper search" because I'm looking often for papers to understand how the topic is where I'm working in.

And if there are any interesting conclusions which I could use. Okay. I'm not ticking "venue search". Question, "venue search": if there's a specific conference does this take count as venue?

investigator: If you search for the specific conference, then it would be "venue search". 

Pink Rabbit: Okay. Thank you. Then I do tick " venue search" because the reason if I am working in a specific field scientifically, I will look at interesting conferences which cover this field to understand if there are any papers that could be relevant for me.

I'm also ticking BibTeX data. because I write scientific papers in LaTeX. "Get full text papers" I assume this means getting access to like the whole paper and being able to read it, yeah. Then I tick this one too. Because like I did most of the searches, which I mentioned before to get access to interesting papers.

And I will also tick the last one "study relations". If I notice that one author, which is relevant for my work has like also the same co-author I will also look for the co-author to understand if they have maybe individually written other papers, but it could be relevant for me.

investigator: Are there any other tasks which have not been mentioned here that you would want to also include under "Sonstiges"? 

Pink Rabbit: I have to think about this.

Yes. I would also use digital libraries to understand how often papers are cited. For example, to understand if they're relevant for the field or if they're outliers. I will write citations to "Sonstiges".

investigator: Okay. 

Pink Rabbit: I think that's it. 


##### GQ2 #####


investigator: Okay, perfect. So, then let's continue to the next question. Which systems or digital library do you usually use to solve these tasks? Please tick all answers, which apply and name others which also apply, but have not been written here and please give a short oral description, why you like, or why you use the system.

Pink Rabbit: First I will tick DBLP cause I'm most close to this one. I know how it works. And thus, I know what to expect. Also, I'm working in the computer science field thus, DBLP is an amazing resource for this. I'm also using Google scholar, because it gives like an easier access to a good search. with DBLP, the search functionality is a bit limited. With Google scholar it's very flexible in what you can search for. Okay. I'm also ticking "usual search engines", Google. Because sometimes it's just nice to get access to a blog article et cetera, which could then link forward to scientific work.

I am not clicking any of the other libraries. I also for "Other" I would also add "Zenodo" which is a repository for data sets which I use quite often for my current project in DBLP. And I'm using this one, because it's quite broad has good metadata. 

investigator: Okay, perfect. So, if you do not want to add anything else, we can continue to the next page. 

Pink Rabbit: I press continue. 


### TASK 1 ###


investigator: Okay, perfect. So, now you see task one. Consider the following task, find two experts on a topic of your liking. Example topics could be "domain specific query languages" or "hashing functions" but should be from the broader area of computer and information science.


##### TASK 1.1 #####


So, you can choose your own topic, whatever you like. And you do not really have to do this task, but we are going to talk about how you would try to solve this task. So, the first question would be what is your chosen topic? 

Pink Rabbit: Let's go with "implementation of database systems".


##### TASK 1.2 #####


investigator: Okay, great. and how familiar are you with this topic?

Pink Rabbit: I would say I am not an expert, but fairly familiar with it. I worked in the topic of data engineering for several years and I listened to university course about database implementation. 


##### TASK 1.3 #####


investigator: Okay. That's great. So, then let's continue to question a three. How would you define an expert?

Pink Rabbit: I would define an expert as a person who has the theoretical and practical knowledge to implement a database more or less on scratch or a person who has a very specific knowledge of one part of implementing database systems. Very important here is theoretical and practical knowledge.

So, I would not necessarily see a person as an expert who only can like write the code but doesn't know the concept and other way around. 

investigator: Okay, great. Do you want to add anything else to this definition or is this good and we should continue with the next question?

Pink Rabbit: I'm happy with the definition. 

investigator: Okay, perfect.


##### TASK 1.4 #####


So now that you have your chosen topic and you know what an expert is, how would you solve the task of finding two experts for your chosen topic? 

Pink Rabbit: I would use several steps. I think first I would use Google and Google scholar to get a broad understanding what kind of books and or articles and or blog posts would come up first if I search for the chosen topic.

investigator: Okay. in which order would you do this, or would you do this in parallel and how exactly would you do this?

Pink Rabbit: Okay. First, I would go to Google and just plainly enter "implementation database systems" and look at the first few results to understand if there's something interesting. I would assume that there will be blog posts or academic websites.

And I would go through the first few results to understand if there's something that looks interesting or relevant for me, or if there are any keywords, I could use to search more specific for this topic, because sometimes we need very specific keywords to actually get their interesting results in digital libraries.

investigator: Okay. How would you get those keywords or how do you identify these keywords? Or where do you then identify them from? 

Pink Rabbit: I would read or skim the page we just opened in Google and see if there’s any pattern, like any specific phrasing all of them use. I would prioritize here articles which seem to be written by someone with an academic background. Yeah. 

investigator: Okay. How do you find information if a person has an academic background? 

Pink Rabbit: I would look at the page. So, as a URL, is this page hosted by university or is it hosted by just a random blog post company? I would also look at the name to understand if they have like a PhD already written there. I might also search for the name of the authors depending on the background I would use either DBLP or Google scholar. to understand if I can get more information about the background of the person, of the people. 

investigator: Okay, great. And then how would you continue if you do have the keywords now?

Pink Rabbit: Okay. So, let's assume because of this research, I already have like some keywords and I have names of some people who work in this field which are academic with a background at least. I would then go to Google scholar search again for the names of these people to understand if I can see further articles or books they published. I would also open Google scholar and just search for the keywords, which I determined. I would skim the names of the articles to see if there's something interesting. I would also look at the number of citations here, the total number of citations wouldn't be that important for me. More important would be for me the relative number compared to the other entries, which I see on Google scholar. So, if I see one paper has 50 citations, but everything else has zero or like 10 citations then I would guess that the article with 50 citations would be quite relevant for the field. 

investigator: Okay. And so how do you choose the two experts? 

Pink Rabbit: I would look at... So, I would now have like several articles which seem to be very relevant to my field, to my specific topic. then I would look at all of the authors of the articles to understand who has like the biggest output. I would look at the places where they're published.

So, if they mainly published in conferences, which are database-focused, or if they mainly publish somewhere else and only have like one paper in the database world. and from there, I would select the two authors who have like the most relevant articles published in relevant conferences and journals. 

investigator: Okay. That's great. Do you have anything to add to this question? Or else, should we continue to the next page?

Pink Rabbit: Maybe one thing to add. At this point I would also have like a list of conferences, which would be relevant for the topic. So, I would probably also skim the papers to latest papers of a few of the conferences to see if there's any name, which comes up very often.

And potentially add one or two people from there to my list of potential experts and repeat the previous process. 

investigator: Okay. where do you get the papers of the conferences from? Do you access the specific webpage or digital library? 

Pink Rabbit: My first try would be to use DBLP. because quite often they have good links and because I'm working out of the university, I have access to most of them or a lot of them, at least. If I cannot find them there or cannot find open access to them, I would use Google scholar to try my luck there.

investigator: Okay. So, anything else, or should we continue with the next task? 

Pink Rabbit: Let's continue. 


### TASK 2 ###


investigator: Okay. Then you can go to the next page where we see task two. Consider the following task, find relevant papers from a topic of your liking, which appeared after 2017. Example topics could be "paper recommendation" or "author disambiguation", but should be from a broader area of computer and information science.


##### TASK 2.1 #####
##### TASK 2.2 #####


And you can also pick the same topic that you have picked before. So, my first question is what is your chosen topic? 

Pink Rabbit: Let's choose the same topic as before. “Implementation of database systems”. 


##### TASK 2.3 #####


investigator: Okay, perfect. So, we can skip question two of the familiarity with the topic and go directly to question three, which asks you to define relevancy.

Pink Rabbit: Would consist out of several things for me. First, I think articles are relevant, which are cited more often than other articles. Second, I would say articles are relevant if they have been written by experts on the domain. third, I would consider articles as relevant if they are published in journal or conferences, which are the big conferences or journals of the topic of the domain. yeah, I think these three tabs, three points would be the most important things to define relevancy for me. 


##### TASK 2.4 #####


investigator: Okay. Then we can go to question four. And how would you solve this task? How would you find relevant papers from your topic, which appeared after 2017?

Pink Rabbit: My general approach would probably be quite similar to task one, which means I would first use Google to understand on a very high level the concept a bit better. So, to understand what kinda blog posts are out there, what kinda articles out there? are there any books which are already recommended on the first Google page? And maybe get also some authors. I would try to understand if the authors are relevant for the topic, if they're experts on the topic in the same way as before. 

So, I would look for them on Google scholar and DBLP. once I have like these keywords and topics and people again, I would look in DBLP and Google scholar to understand where these people publicised their articles. I would also look if there are any conferences which come up quite often, if I look for the relevant keywords so I would focus on the articles there on the conferences there and general sources, the venues once I have like the relevant venues I would use venue search probably on DBLP if I want to go for individual venues otherwise I would use Google scholar. there, I would select only literature, only articles published in 2018 or later. and again, use the number of citation, the relative number of citations, to understand which ones are most relevant ones. Since we don't have the number in DBLP, there I would look for names, which I recognize from my previous search and for articles, which seem to be explicitly relevant to my specific topic, which I would be looking for. 

investigator: Okay. That's great. you mentioned in the beginning that you were using Google and then after for the keywords and the overview of the what's the area, and then that you would use Google scholar and DBLP for authors. So, would you do both at the same time or would you do it first, the Google part with the keywords and then the part with the authors? Or would you do it interleaved or, yeah?

Pink Rabbit: I would definitely start with Google and stay on Google for the first few minutes. I would open and skim a few web pages there, but not skip yet, jump yet to any of the academic research tools once I have like an rough overview, I would use both in parallel probably.

So, I would use Google and, DBLP and Google scholar more or less at the same time. once I feel I have a broad overview on the keywords I would stop using Google and only use DBLP and Google scholar. So, I would gradually switch from Google to the digital libraries. 

investigator: Okay. That's great. Yeah. Do you want to add anything else to this question?

Pink Rabbit: No, I'm happy. 


### Thank you ###


investigator: Okay, then thank you so much. I'm going to stop the recording.

